A signal-to-noise analysis of phylogeny estimation by neighbor-joining: Insufficiency of polynomial length sequences.
نویسندگان
چکیده
Phylogeny reconstruction is the process of inferring evolutionary relationships from molecular sequences, and methods that are expected to accurately reconstruct trees from sequences of reasonable length are highly desirable. To formalize this concept, the property of fast-convergence has been introduced to describe phylogeny reconstruction methods that, with high probability, recover the true tree from sequences that grow polynomially in the number of taxa n. While provably fast-converging methods have been developed, the neighbor-joining (NJ) algorithm of Saitou and Nei remains one of the most popular methods used in practice. This algorithm is known to converge for sequences that are exponential in n, but no lower bound for its convergence rate has been established. To address this theoretical question, we analyze the performance of the NJ algorithm on a type of phylogeny known as a 'caterpillar tree'. We find that, for sequences of polynomial length in the number of taxa n, the variability of the NJ criterion is sufficiently high that the algorithm is likely to fail even in the first step of the phylogeny reconstruction process, regardless of the degree of polynomial considered. This result demonstrates that, for general n-taxa trees, the exponential bound cannot be improved.
منابع مشابه
Analysis of mitochondrial DNA sequences of Turcinoemacheilus genus (Nemacheilidae Cypriniformes) in Iran
Members of Nemacheilidae Family, Turcinoemacheilus genus were subjected to molecular phylogenetic analysis in this study. This genus was reported in 2009 to inhabit in Karoon River drainage, in contrary to previous assumption that it was the endemic species in the Basin of Tigris River. It was sampled from three stations placed in different tributaries in Karoon drainage and evaluated to unders...
متن کاملMolecular Characterization and Phylogeny Analysis Based on Sequences of Cytochrome Oxidase gene From Hemiscorpius lepturus of Iran
Abstract: Background: Hemiscorpius lepturus is a medically important scorpion found along the Iranian borders, especially near to Khuzestan Province in the south-west of Iran. This is the only non-buthid scorpion which is potentially lethal in southern Iran and is responsible for severe dermonecrotic scorpionism. OBJECTIVES: In this study, DNA fragment of the mitochondrial cytochrome c oxidase ...
متن کاملPhylogeny of Phytophthora and Phytopythium species associated with rice in Fars province (Iran)
In order to investigate the Oomyceteous species of the rice paddy fields of Fars province (Iran), during 2013–15, infected roots and crowns together with soil around seedlings and irrigation water were sampled. Based on the morphological, morphometric and physiological studies along with phylogenetic analyses of internal transcribed spacer sequences based on neighbor joining method, two Phytoph...
متن کاملA phylogeny analysis on six mullet species (Teleosti: Mugillidae) using PCR-sequencing method
In this study, genetic differences and phylogenic relationships among six Mugilidae species (Mugil cephalus, M. capito, Liza subviridis, L. saliens, L. aurata, Valamugil buchanani) were determined using PCR-sequencing. M. cephalus, L. subviridis, and V. buchanani from the Persian Gulf and Oman Sea, and L. aurata and L. saliens from the Caspian Sea were col-lected. Samples of an imported, Egypt...
متن کاملPhylogeny of urate oxidase producing bacteria: on the basis of gene sequences of 16S rRNA and uricase protein
Uricase or Urate oxidase (urate:oxygen oxidoreductase, EC 1.7.3.3), a peroxisomal enzyme which is found in many bacteria, catalyzes the oxidative opening of the purine ring of urate to yield allantoin, carbon dioxide, and hydrogen peroxide. In this study, the phylogeny of urate oxidase (uricase) producing bacteria was studied based on gene sequences of 16S rRNA and uricase protein. Repres...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Mathematical biosciences
دوره 199 2 شماره
صفحات -
تاریخ انتشار 2006